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ABSTRACT 


Two established methods of code improvement, Day [4] and 
Kildall [7], are reviewed. The problems of optimal register 
allocation are discussed, A method is presented using 
Kildall's {7] optimization algorithm for specifying the 
active data items in ae program. Denonstraticn of 
particular problems with register allocation are presented. 
Topics for further consideration in a complete soluticn are 


discussed. 





Tapp or CONTENTS 


I. AN INVESTIGATION OF THE PARAMETERS FOR REGISTER 


ALLOCATION DURING COMPILATION ..c ccc cccccnnccccccce 
rt . BACKGROUND @®eeoaoeoewe?@¢eese@seesee@*esee@ee@e@eseeseeeseaPesesePseeeeveeteersesee@ee@sestseseseese ®@ 
A. CEITIMAL REGISTER ALLOCATION, DAY [4] ...-2 eng. 


B. GLOEFAL EXPRESSION OPTIMIZATION, KILDALL [7] 


ELIL. THE CONCEPT OF PROFIT cece ccccceesnvccaccccee 
IV. ACTIVE DATA ITEM EXTENSION w.cc ce eee ee wee ee ae 
V. REFERENCED DATA ITEM ANALYSIS cccccccccwcccee 
Vere ne mere Nit Hh ALLOCATION 2. cccccccccccvcscces 
Vere VO RMGrC UO EOS 6 6 os ce ween Cee cee ecw ee cece ees 
eeeuomeeOonnM OF REGISTER REASSIGNMENT ..... 
Pec eemrmneee QUATTON ANALYSIS ..2.ccecssccce 
Coe  DOUAL REGISTERS wcce cence seccaes 
De He Gdeeres PRELOADING 6. cc cease ce wee we ewes 

Ee ALTERATION OF THE COMPLEMENT EQUATION 
Cem eetre o LCS ne se eas sees eo ee eee eee eee 
Weltee CON CU Wem ans ss 5 cn se ee we we ces ew ce eee ens ceece 


“JI DHA OV 


15 
17 
tee, 
23 
23 
24 
24 
24 


26 
27 





iiol OF FIGuReS 


cc mmereOr AGI P YwehrI Re TERISTICS ..c2ccccceseses L1 





ACKNOWLEDGEMENTS 


The author wishes to extend personal and hearttrelt 
thanks to Gary A. Kildall whose efforts as a professor 
provided the interest in the subject area, whose efforts as 
an academician provide the structure for many of the ideas 
presented, and whose patience and time spent in review 
helped keep fantasies founded in reality. 

Also the author wishes to thank EnSign Gary Raetz who 
gave much tine as a student and laboratory partner and later 
as a second reader to many efforts which helped both 
directly and indirectly in the development of this thesis. 
In conjunction with Gary Raetz's time, thanks are also 
extended to his wife Mary for understanding his absences 


from her. 





at. BACKGROUND 


te ee ge ee ee ew ee ee 


The advent of higher order languages began the era of 
compilers and subseguently optimizers. The problem with 
compilers in general is that more efficient code can often 
be written by an assembly level programmer. Efficiency in 
this case is measured in the execution time for the prcgraa, 
the amount of memory reguired to store the program, or 
both. The desire for improved code lead to the develcpment 
of the science of code optimization. The faeld» of 
optimization has been explored [1,3,6,7,10] and several 
basic theories have evolved. 

One important aspect of optimization is register 
allocation. The register allocation problem is the froblenm 
of assigning data items to registers so that the resulting 
code is efficient. The extension or the problem is to 
coordinate register assignments made on different tranches 
and to determine data item replacement. Technigues will be 
presented for examining these two probienms. 

The discussions which follow will relate to a class of 
computers which are characterized by having a set of general 
purpose registers. The methods discussed would not apfly to 
Stack machines, for example. 

Data item allocation will be to one of a set of general 
purpose registers. The methods will then apply tc _ the 
allocation roblem by data item type. That is to say, the 
methods may te applied to allocating floating point data 
items toa set of floating point registers and integer data 
jtems to a set of integer registers as long as the data item 
type can be identified by the compiler. 

Several authors have addressed the problem of optimal 
BeoGaawetmeal location, either direct#y @©@ indireetly. Itewall 
be the purpose of this tretise to examine two of these 


techniques [Day 4,Kildall 7] with the purpose of determining 





the optigizing information which 1S necessary foc optimal 


allocation. 


A. OPTIMAL REGISTER ALLOCATION, DAY [4] 


One solution to optimal register allocation has been 
presented by Day [4]. Day's method 1S based primarily on 
the concepts of data item interference and profit. 

Day defines a data item aS a constant or a data name. 
"A data item is defined when statement execution causes a 
new value to become associated with the data itene™ 
Constants are defined by their representation and their 
values are not normally changed. "A data item is referred 
to when the current value of the data item is required for 
correct statement execution." Having the current value 
required for correct statement execution implies that the 
value is at least temporarily in a register. "A data iten 
i ACen pOIme an” a region “if it may bewsreferénced 
subsequent to that point." Day defines a region to be a 
Strongly connected subgraph of the program when represented 
as a directed graph. A strongly connected ‘subgraph, by 
definition, means that any node in the subgraph can be 
reached from any other node in the subgraph. Because of 
this characteristic of strong connectivity when ccmbined 


with the definition of an active data item, a data item is 


necessarily active over an entire region if it 1s active at 
any point in the region. Strongly connected tregions 


intuitively correspond to nested loop’ structures in the 


source progran. Day states that "two data items interfere 
in" a region "if they are both active at a point in" the 
region. Extension of the concept of active data items 


implies that any data items active in a regicn must 
necessarily interfere with any other active data item in the 
region at all points in the region. Since a region is 
strongly connected, all points in the region are necessarily 


subseguent to every other point in the region. 





The principal characteristic of interfering data items 
is that if they are allocated to the same register, at some 
polmt “shey Swill both be active, by the definiticn of 
iterference, and may not be allocated to the same register 
at that POM . Likewise, the characteristic of 
non-interfering data items is that at no point are any two 
non-interfering data items active. 

The concept of profit 1S a numerical “representation of 
the improvement in program execution that may occur if the 
data item is globally assigned to a register being 
processed." The comparative values of the profits are the 
deciding factors in making the assignment of a data item to 
a register. "The values assigned to the profit equation 
constants determine whether the profit represents a 
projected improvement in program size or execution time." 
Day assumes "the profit of a particular global assignment to 
be the sum of those data items therein assigned 120 
registers." In terms of the analysis, the method is to 
maximize over all possible asSignments to identify the 


asSignment with the largest profit value. 
A basic block is an ordered set of statements 


ows 2 -S3 4s«.,Sk} 

which is entered only through S1 and branched from only at 
Sk and where Si is executed before Sittl. Day uses this 
definition in the discussion of both lecal and global 
assignment. "Local and global assignment differ in the 
extent of the program over which the assignment of data 
items to registers is effective: local assignment cccurs 
Within a basic block, while global assignment occurs within 
a region." 

The desirability of global asSignment stems frem the 
weakness of the more easily conducted local assignment. Day 
states that one "weakness in local assignment involves the 
dispositicn cf data items that are defined or referred to in 


MEEIOGcK sang are active on entry towor exit from the khock. 





Local assignment cannot usually retain assignment history 
across kElock boundaries, and so the values of active data 
items must be moved to main storage for interblock transfers 
OLweoncror. 

Day introduces three types of allocation: cné€-one, 
many-~one, and many-few, "A Cne-one assignment defines a 
one-to-one correspondence between" the data items and _ the 
registers. A weaknesS in global one-one aSsSignment is that 
it is usually incapable of assigning more than one data iten 
to a register in a region. Day's approach to the solution 
of the problem "is to consider a set or data items for 
assignment to a register if no two data items in the set 
interfere at any point in the region." Day's global 
many-few assignment method has this characteristic. 
Many-few assignment is a Single valued mapping of a subset 
of the data items in a program onto a set of registers where 
the number of data items competing for aSSignment is greater 
than the number of available registers. 

Day presents a solution method for the global many-few 
proklen utilizing matrix construction and multiplication to 
implement the interference characteristics of the data 


items. 


B. GLOEFAL EXPRESSION OPTIMIZATION, KILDALL [7] 


Kildall conducts an analysis of prcegram structure in 
order to produce optimized object code. Kildall utilizes a 
directed graph to represent the program flow, along with an 
"optimizing pool," an “optimizing function," and a "meet 
operation" to conduct his analyses. 

An optimizing pool is associated with each node in the 
graph. The nature of the pool is an arbitrary set which 
describes the optimizing information asscciated with a 
particular node in terms of the analysis being conducted. 

An optimizing functicn maps an “input pool" and the 


optimizing pool of a node to a new "output" pool. In every 





instance, the input pool for a node is derived from the 
output fpcols for the node's immediate predecessors. The 
output pcol of a node contributes to the input pool of the 
node's immediate successor(s). 

The meet operation is defined to handle the problem of 
combining two or more input pools at a point where two or 
more program flows join, and varies for differing types of 
analysis. The meet operations defined are binary, 
associative, and commutative. The meet operation 1S a 
Mapping of the set of all optimizing fools onto itself. 
This can be represented as: 

PXP—>P 
(where P is the set of all optimizing fools). 

Kildall defines several types of analysis. Two of his 
analyses are of primary interest and will be reviewed 
below. The two are common subexpression eliminaticn and 
live variable analysis. 

For common subexpression analysis, the pool of computed 
expressions iS partitioned into equivalence classes whose 
members are known to nave identical values. The optirizing 
function for common Subexpression analysis manipulates the 
equivalence classes of the partition. "Two expressicns are 
placed into the same class of the partition if they are 
known to have equivalent values." The meet operation for 
common subexpression analysis is intersection by equivalence 
classes. 

For live variable analysis a reversed program flcw graph 
is used for the analysis. At any point, the pool associated 
with a node is the set of data items which may be referenced 
Subseguent (in the forward direction) to the node. The 
optimizing function for live variable analysis has two 
distinct characteristics. These are: 

"1. If the expression at node N involves an assignment 

to a variable, let d be the destination of the 
assignment; set P€&P-fej}d is a subexpression in 


e} (d and all expressions containing d become dead 


10 





expressions) " (e is the set of all partial 
computations at the current node.) 
"2. Consider each partial computation e at node N. Set 
EF<P f{e} The value of the optimizing functicn is 
altered to the value of P." 
The meet operation for live variable analysis 1s set 
Suga ol ye 


For completeness, Kildall‘s flow analysis algorithm is 


presented below.. The following notation will be used in 
the presentation: P is the set of all possible optimizing 
pools. iB is an entry pool set. 1 is the unit element 


for the analysis being conducted. 
Alf initialize ] Moa 8 


A2{ terminate? ] If L=9 then HALT 


A3[select node} Let L'éL,L'=(N,P1i) for some Né€N and 
Pi€P, L€L-{L'} 


Auf traverse? ] Let Pn be the current approximate fool 
of optimizing informaticn associated 
with the node N (initially Pn=1). If 
Pn€Pi Go To step A2. 


A5[ set pool] Pn©PnAPi, L#LU{N', £(N,Pn)) | N'ET (N)} 
A6[ loop ] Go To step A2. 


Examples of optimizing pools, an optimizing function, 
and a meet operation are presented in secticn V. The tern 
giobal will be used henceforth to refer to an entire program 
and not just a region. 

By utilizing Kiidall's methods, the data item ccncept 
can be expanded to include expressions, This extension is 
desirable bécause a repeated expression would have to be 


recomputed ie a allocation were only to variakles and 


11 





constants. Kildall [7] presents a data structure which may 
be used for manipulating the data items under the expanded 


definition, 


liz 





III. HE CONCEPT OF PROFIT 


The concept of profit is essential to register 
allocation. Day's definition of profit is a linear 
comkinaticn of the number of definitions of and references 
to a data item in a region. 

The expanded definition of data item may mean that the 
number of references may not accurately and completely 
reflect the value of a data item (an expression, for 
example) in a register. It will not be the purpose of this 
paper to specify an explicit profit function. However, the 
contributing factors of the profit function under the 
expanded data item definition will be discussed below. 

In general, it is assumed that the profit should reflect 
a measurable guantity. If the optimization is to be toward 
program size, the profit function should be a meaSure cf the 
relative number of instructionS required to execute the 
resulting code. If the optimization is to be toward frogran 
run time, the profit function should be a measure cf the 
execution time of the resulting instructions. The two 
concepts, of course, are often closely related. 

The profit should increase with the number of references 
to a data item over an active region. If a alea EEO rat 
results from this factor, it would imply a decrease in the 
number of load operations (and combining operations in the 
case of expressions). This factor would then imply a 
decrease in program size and, depending on the operation 
times, often leads to Savings in run time. 

The Eprorit should increase WIth the number of 
instructions necessary to replace the data item in a 


register, due to the fact that the data item concept is 


extended to include arbitrary expressions. This factcr 1S 
called "complexity," since the profit is related to the 
complexity of the data item. TO sreGiCce Eun ‘tine, profit 
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should assign different values to the operations as to 
execution time. The slower execution times of a particular 
operation wculd lead to a higher profit Since it would 
require more time to replace the data item in a regiSter. 
Exponentiaticn would be weighted more heavily than addition, 
for example. 

The profit should decrease with increased distance to 
the next reference, thus preventing highly complex data 
items from holding a register over long program flows 
without reference. 

The PEOnIe Should be adjusted with program flow 
infcrmation, when available. Logically, a data item on a 
highly executed branch would have higher valve in a register 


than a Similar data item on a seldom executed branch. 
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IV. ACTIVE DATA ITEM EXTENSION 


—_— se = 


One of the primary attributes of Day's analysis deals 
with the concept of interfering data items. A problen with 
Day's definition of an active data item when combined with 
his definiticn of a region was mentioned above. It becomes 
desirable to make a new definition of an active data item to 
correct that problem. Intuitively, a data item 1s active 
between a definition of the data item and the last reference 
to the value of the data item which was thereby defined. [In 
terms of register allocation, this definition may be stated 
as: a‘ data item 1s active at all points where the value 
of the data item must exist or have existed in a register 
for proper statement execution and remains active to the 
last reference to the data item for which the valve which 
existed in a register would yield correct execution. 
Infcrmally, in terms of registers, a variable becomes active 
when the associated value exists in a register and remains 
active over the range to the last point at which it is 
referenced prior to redefinition or program termination. [In 
other words, it is the range over which a data iten 
maintains the value which was at one time loaded into a 
register. 

To an extent, live variable analysis corresponds tc this 
definition. Variables are included as being live over the 
range from which they are assigned a value by an executable 
Statement to the point of their last reference frior to 
redefinition (by an executable statement) Or program 
termination. Live variable analysis departs from the 
definition given for active data items in two ways. 

The first departure of live variables from active 
variaktles comes from the case of data items which are 
implicitly defined. Implicit definition may be made Ly the 


representation (in the case of constants), by compile-time 
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assignments (e.g., the FORTRAN "DATA" statement), or by 
default memory initialization. Implicitily defined data 
items are evaluated as live from program entry to the last 
reference to the data item (with possible non-active 
sections interspersed). For implicitly defined data items, 
however, the value associated with the data item does not 
exist in a register until the first reference. Implicitly 
defined data items are, therefore, active frem their first 
reference to their last reference or redefinition. 

The second departure of ilive variables from active 
variables may occur from a READ-type statement. Depending 
on the machine configuration and the data manipulaticn for a 
READ-type statement, the data item read may or may not have 
existed ina usable form in a_ register. Live variables 
begin a live segment with the definition by a REAL-type 
Statement. Tepending on the data manipulation, a READ 
statement may or may not originate an active program segment 


for that data item. 
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V. REFERENCED DATA ITEM ANALYSIS 


= — me —_— Ss = = — ee ee 


In order to analyze data item interference based on the 
revised definition of active data items, referenced data 
item analysis is introduced. The purpose of referenced 
data item analysis iS to provide a method which determines 
data items possessing the characteristics of active data 
items nct possessed by live variables so that the active 
data items may be determined. In particular, referenced 
data item analysis produces sets of data items which have 
previously appeared in a register. 

The optimization pool for referenced data item analysis 
is the set of all data items which have been referenced 
previous to the current point in the program flow. The 
optimizing function performs a union of all data items in 
the expression at the current node with the input set of 
referenced data items. Thus for an expression R=A+E ata 


node N with an input pool of {X,Y,2Z} 


F(N, {X,Y,2Z})={X,Y,2} U {R,A, E}={R,A,B, X,Y, 2} 
where F(N,Pn) is the optimizing function operating on node N 
and the ccrresponding input pool Pn. 

Active data items are not dependent upon the progran 
branch structure, An active data item 1S active from a 
first reference (loaded into a register) to the final 
reference, with possible inactive segments interspersed. 
The meet cperation is, therefore, set union. 

As discussed above, the inclusion of a variable in the 
referenced data fools which participate in a REALI-type 
Statement will be dependent upon the machine for which the 
output code is intended. 

As discussed in section IV, live variables have the 
characteristics of active variables with the excepticn that 


their pcint cf entry into the set of active data items may 
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overextend the definition point. Referenced data items have 
the characteristics of active data items except that the 
data items may (and in general will) extend past the last 
reference. Intersection of pools for these two analyses at 
every point in the program flow, then, will produce sets of 
data items which are active at that point. It shculd be 
noted that a single forward pass is insufficient for active 
variable analysis because on a forward pass the current 
reference to a variable is not Known to be the last. 
Similarly, a single reverse pass is insufficient because the 


current reference is not known to be the first reference. 
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VI. A PROBLEM WITH ALLOCATION 


The problem of register allocation 1S now considered. 
There are certain necessary requirements which are implicit 
because of the interference characteristics of the 
variables. The nature of these necessary requirements is 
that no two mutually active variables may be assigned to the 
same register. There are some desired characteristics 
which are derived from the branching structure of the 
program. The nature of the deSired characteristics is that 
variables active on several branches be in the same register 
at the point where the branches join. | The necessary and 
the desired characteristics will now be discussed. 

At any point in the program, only one data item may be 
allocated to a register. Day {4%} allocates a set of 
non~-interfering data items to a register. The 
characteristics of non-interfering data items, however, 
imply that only one data item at any given point will have 
value in the register (be active at that point). fTherefcre, 
although Day allocates a set of data items, at any point 
only one memker of that set will be ina register. 

Further, all active data items have value in a 
register. If the number of active data items is greater 
than the number of registers, then a Selection must be made 
of the data items to be allocated. PEO Eales”. 4S the 
measurement used to make the selection. The selecticn may 
be made Ey reducing the set of active data items at each 
node to the M most profitable data items, where M is the 
number of registers. The necessary reguirements will be 
derived from these reduced active data item fools. That is, 
no two members of any reduced active pool at any point may 
be allocated to the same register at that point. 

It is desirable that data items allocatcd on different 


branches and are active at a point where the branches join, 
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be allocated to the same register on each branch. By 
meeting this desired characteristic, the register will hold 
the correct value of the data item regardless of the tEranch 
taken to reach the point where they join. The desired 
Characteristics apply only in the event the data item is 
active cn all branches which join at a point. If the data 
item was not active just prior to the join point, the 
contents of a register would be dependent upon the Eranch 
taken at run time. 

Figure JI is now presented to illustrate the desired and 
the necessary characteristics of a program Segment. There 
are three variables in the example, X,Y and U. For purposes 
of the example, Ra(d) and Rbd(d) will represent the register 
allocated to the data item represented by d cn branch a and 
b respectively. R1 and R2 will represent the actual 


registers in the two register machine. 


branch a branch b 


FLEGURE 2. 
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Mond U akendGtive prior to point 1 amd ower a1 o4 
branch a. X 1s active on branch b to point 2 and fron 
Peamt ss tO pount 4. Y¥Y is active on branch b from point 1 to 
point 3. U is active on branch b from point 2 to point 4. 

The necessary requirements in the example are shown 
belcw. 

1 Ra (X) #Ra (U) 
2 Rb(X) #Rb(Y) 
3 Rb(U) #Rb(Y) 
4 Rb(U) #Rb(X) 


The desired characteristics in the example are: 


1 Ra (X) =Rb (X) 
2 Ra(U) =Rb (UV) 

Another set of constraints stems from the activity 
Characteristics of the variables in conjunction with the 
hecessary requirements. At Powe 1, for example, 
Rb(X)#RD(Y) and at point 2 Rb(U)#RD(Y). Since there are 
only two registers in the machine, these requirements 
combine tc imply that RD(X)=Rb(U). 

More specifically, the register released by a data item 
when it becomes inactive 1S subseguently used by another 
data item when it becomes active. Thus the register used by 
the first item is "eguated to" the register of the second 
data iten. Due to the fact that the activity of the data 
items must net conflict, this action is termed “conplement 
equation." 

The ccmplement equation characteristics cf the eéxanple 


are: 


1 Rb (U) =Rb(Y) 

2 Rb(X)=Rb(U) 

3 Rb(Y) =Rb(X) 
The complement equation characteristics, the necessary 
requirements, and the desired characteristics combine to 
reach a contradiciton. The contradiction may be represented 


DY : 


ZO 








Ra (X) =R1 Arbitrary 
Ra (U) =R2 Necessary Reguirement 1 
Rb (Y) =R2 Complement Equation 1 
Rb (X) =R1 Necessary Requirement 2 
Rb (U) =R1 Complement Equation 2 
Rb (X) =R2 Necessary Requirement 4 
Rb (X) =R1 Desired Characteristic 1 
Note that the last two assignments are in ccenflict. 
Arbitrarily setting Ra(X)=R2 would lead to the same 
contradiction. 
The effect of this contradiction is that, if an 
alteration is not made to at least one of the 
characteristics, a reference to X or U after point 4 would 


reguire reloading of the variable being referenced. 
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locally using globally derived information, using Kildall's 
techniques [7}. 


B. COPPLEMENT EQUATION ANALYSIS 


The ccncept of complement equation was introduced in 
section VI. The projected purpose of complement equation 
analysis 1S to specify the complement equation constraints 
of a program. Complement equation, being based cn the 
activity characteristics of the data items, should oferate 
on the sets of active variables. By comparing the focls of 
active data items from node to node, the changes in the 
pools represent the registers of the data items which are 


complement eguated. 


C. DESIRABLY EQUAL REGISTERS 


Desirably equal register assignments occur where tho or 
more pregran flows join. The program characteristic which 
leads to desired register equation is as follows: registers 
of data items are desirably equal if they are active cn two 
or more Eranches prior to a join point, and are active at 
the join point, Specification of this Situation may be made 
by intersecting the active pools prior to the join fpoint 
with the active pool at the join point. Data items which 
are in the intersection are then desirably in the same 


register cn all branches. 


D. REGISTER PRELOADING 


Preloading is the process of loading a register frior to 
ae gOin point. Preloading may be desirable aen two 
Situations. 

The first situation arises from the expanded definition 
of data items to include expressions. The situations ray be 


detected by comparing the complexity of the comnon 
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Subexpression pools from pre-join nodes to the join node. 
If the complexity of the pool increases, the preloading may 


be profitable. For example, if the pool structure is 
{X, A+B} {Y¥, A+B} 


{A+ B} 
it may be worthwhile forcing AtB to a register at the join 


point. Tc load AtB at the join point would require 


LOD A 

ADD B 
However, pricr to the join point, A+B may te loaded into a 
register by loading either X or Y requiring cnly one 
operation. 

The second situation for which preloading may be of 
value occurs when several branches join and a data item is 
active cn several, but not ali of the branches joining at 
that point. By preloading the data item on the branches on 
which it iS not active, the code on the kLranches say be 
fully utilized. Thus the code 


LOD A LOD A g 
ADD B ADD B 
STO X STO X 
LOD A 
ADD B 
STO Y 
could be modified to 
LOD A LOD A LOD 
ADD B ADD B ADD B 
STO xX STO xX 
SiO7 -X 





in which case the correct value of A+B would ke in a 
register and could be stored at the join point. The 
resulting ccde in this case would have the same progran 
size, but wculd execute in a shorter time, especially if the 


Light branch were seldom executed. 


E. ALTERATION OF THE COMPLEMENT EQUATION CHARACTERISTICS 


The complement equation characteristics are the 
constraints which must be relaxed. That is, given that a 
contradiction exists, the necessary characteristicS cannot 
be changed, and while the desirably equal registers 
constraints may be altered, this could lead to excessive 
load-store operations. The opportunities for altering 
these charcteristics exist in three forms. If there is a 
node~to-nodée change in the active pools of two or mcre data 
items, the newly active data items may be loaded intc any of 
the vacated registers. If there are fewer than M active 
data items at a node (where M is the number of registers), 
then the complement equation characteristics may be altered 
by perfcrming a register-to-register move. The thie 
alteration made be made at any point by storing the current 
register contents to a temporary location, performing a 
register-to~-register movement, and loading the vacated 
register from the temporary location. 

The purpose Cpe altering the complement equation 
characteristics is to satisfy the desirably equal reaister 
constraints. When the alterations are made at a cost, such 
as storing, performing a register-to-register move, and 
loading, the cost would have to be balanced against the 
advantages gained by satisfying the desiraktly equal recister 


characteristics. 
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X. CONGTUSTONS 


ee ee ee es ee 


Using Kildall's {7] algorithm, a method was presented 
for specifying the active data items. The ~conce peso F 
reducing the active data items at each node to the M most 
profitable (where M 1s the number of regiSters) was 
mit roduced. A description of the nature of the desired and 
necessary characteristics of allocation was fresented. 

The existence of the contradictions specified in 
section VI implies that there may not be a satisfactory 
solution to every register allocation problem. If there is 
ho universally satisfactory solution, the problem then 
becomes a linéar programming problem. AS in Day's solution, 


the problem may be informally stated as: 


MAXIMIZE: Profit 
SUBJECT TO: 1. Necessary Requirements 


2. (Desired Characteristics) ' 


where (Desired Characteristics)' may be a proper sukset of 
the desired characteristics of the program. Profit in this 
case is the sum of the profits of the data items assigned to 
registers and the equating profits less the equating ccsts. 
Maximizing the profit of the variables at each node will 
ensure that the profit associated with the data items is 
maXimum. The natural extension would imply that maxiazizing 
the profit of equating at each step would also result in a 
maximum profit globally. This may not ke true, however, 
Since changes to the complement equation characteristics, 
when made, are effective over all nodes subseguent to the 
node at which the alteration is made. Thus an alteration to 
gain a desired register eguating may require other 
aqiteraticns in the complement equation characteristics Which 


Wor have an associated cost. None of these statements 


2] 





have been formally specified, however, and remain as topics 
for further investigation. In the final amalys@s, it 
appears that the techniques discussed here must be applied 
somewhat heuristically in an attempt to obtain a "good!" 
allocation. This allocation may be incrementally improved 
but, considering the current state of the theory, no 


absolute statements are possible at this time. 
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